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Abstract 

Our aim is to provide a short and self contained synthesis which generalise and unify various re¬ 
lated and unrelated works involving what we call $-Sobolev functional inequalities. Such inequalities 
related to ^-entropies can be seen in particular as an inclusive interpolation between Poincare and 
Gross logarithmic Sobolev inequalities. In addition to the known material, extensions are provided 
and improvements are given for some aspects. Stability by tensor products, convolution, and bounded 
perturbations are addressed. We show that under simple convexity assumptions on $, such inequal¬ 
ities hold in a lot of situations, including hyper-contractive diffusions, uniformly strictly log-concave 
measures, Wiener measure (paths space of Brownian Motion on Riemannian Manifolds) and generic 
Poisson space (includes paths space of some pure jumps Levy processes and related infinitely divisible 
laws). Proofs are simple and relies essentially on convexity. We end up by a short parallel inspired by 
the analogy with Boltzmann-Shannon entropy appearing in Kinetic Gases and Information Theories. 
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1 Introduction 


Let $ : X ^ R be a smooth convex function defined on a closed interval X of M not necessarily 
bounded. Let /i be a positive measure on a Borel space (fl, T). The $-entropy functional Ent|^ is defined 
on the set of ^-integrable functions / : (tljX") ^ (X, B(X)) by the following formula: 

Ent^(/) := [ ^f )d^i-<^>( [ fd^^. 

Jn \Jq / 

Obviously, such a formula makes sense only when dix G X, which is always the case when /x is a 
probability measure. Unless otherwise stated, the $-entropy in the sequel will be always considered for 
probability measures. One has then that 

Ent;^(/) = E^($(/))-$(E^/). (1) 

In addition, depending on X, the integrability condition on / can be relaxed and Jensen inequality implies 
that the $-entropy functional takes its values in IR_|_ U {+oo}. Moreover, it is convex with respect to its 
functional argument at fixed mean. As we will see, the global convexity requires more assumptions on $. 
If / is /i-a.s. constant, Ent|^ (/) vanishes, and the converse is true when <I> is strictly convex. Sometimes, 
we will drop the /x subscript in Ent*. For any random variable X : {Vl',T') (0,X'), we will denote 

Ent^(/(A)) := Ent?(^)(/). 

The classical variance and entropy can be recovered since we have 

= Var^ and = Ent^. (2) 

Notice that the <I>-entropy functional / Ent^(/) is neither homogeneous nor translation invariant in 

general. Nevertheless, since it is non-negative and vanishes when its argument / is a constant function, 
it can be a good candidate as a left hand side of a Sobolev like functional inequality where the right hand 
side is a Dirichlet form. 

Actually, the term “$-entropy” is quite arbitrarily chosen, since we can speak about “<I>-variance” 
too, but this term is perhaps more adapted to the quantity 

Var;^(/):=E^(<i>(/-E^/)), (3) 

which is translation invariant and gives the classical variance Var^ when <l>(x) = x^, but the entropy 
Ent^ cannot be recovered. One can remember the famous quotation from John Von Neumann about 
entropy, which can be found for example in [ABC+00, Chap. 10]. Notice that similar ^-entropies appears 
with that name in a slightly different forms in a lot of papers related to Information Theory and Convex 
Analysis fields, see for example [BR82b], [BR82a], [BR82c], [TV93] and [BTT86] and references therein. 
The d’-entropy is related to the so called (/i, <I’)-entropies, see for example [MMPS97] and references 
therein. The <I)-entropy is also known as J-divergence (J stands for Jensen). See also [Csi63, Csi72] for 
the similar notion of (/)-divergence. 

Let be a Markov process on a Polish space Q. equipped with its Borel u-field, say for example 

(IR'^,]B(]R‘^)). We define the classical associated Markov semi-group (Pt)j^g acting on Cf,(r2,]R) by: 

P*(/)(x):=E^(/(At)|Ao = x). (4) 

Let us assume that there exists an invariant measure /x, i.e. a positive Borel measure /x on fl stable by 
When ^ is a probability, we get that C{Xo) = pL implies X(At) = p, for all t G IR+. We denote by 
L := 9t=o+ Pt tbe infinitesimal generator of (Pt)(>o with domain X>(L) G L^(/i), and by P the associated 
“carre du champ” operator defined by: 

r(g,h):=^(L(gh)-gLh-hLg). (5) 

By convention, P^ := r(g,g). For Markov processes, Fg is always non-negative. The invariance of a 
probability measure /x is equivalent to 


VfGV(L), E^(L/) = 0. 
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We say that a positive measure /i is symmetric if and only if L is symmetric in L^(/i), i.e. (/ , L = 

(g, for all / and g in I?(L). Measure g is invariant if it is symmetric but the converse is false 

in general. Symmetric measures lead to an integration by parts formula 

-{g, = {T{g, /i))l 2 (^) = (r(/i, 5 ))l2(^)- (6) 

The reader may find an introduction to the analysis of Markov semi-groups in [Bak94] and [Bak02] for 
example, where the delicate problem of the existence of an algebra of functions A stable by semi-group 
and generator is addressed. By invariance of /i and Jensen inequality for <I>: 

Ent^(PJ) = E^($(P,/)) - $(E^/) 

<E^(P*<i>(/))-ci>(E^/) 

= Ent;^(/)- 

On the other hand, if the semi-group is L^-ergodic, Ent*(Pt/) converges to 0 when t tends to -|-oo, and 
we get that: 

0 = Ent^(Poo/) < Ent^(PJ) ^ Ent;^(Po/) = Ent;^(/). 

Actually, one can show that any $-entropy related to the invariant measure of a Markov process is 
non-increasing along the associated Markovian semi-group: 

Proposition 1 (DeBruijn like property for Markov semi-groups). Let {Xt)t^o be a Markov process 
on a Polish space LI equipped with its Borel a-field. Let (Pt)t^o associated Markov semi-group with 

infinitesimal generator L and “carre du champ” P. Assume that p is an invariant probability measure. 
Then, for any suitable function f : LI ^ X and any t > 0: 

dt Ent^(P J) = E^($'(P J) LP J) ^ 0, (7) 

When p is symmetric, one has the following formulation 

dt Ent;^(PJ) = -E^(r($'(PJ), Ptf)) . (8) 

Moreover, when is a diffusion process, one has: 

dtEnt^^iPtf) = -E^(<I>"(PJ) PPtf) . (9) 

Proof. Equality in (7) follows immediately from definition of Ent^ and L. The symmetric case (8) comes 
by integration by parts (6). Let us show that the right hand side of (7) is ^ 0. Jensen inequality for 
convex function and probability measure Pt yields $(Pi(g)) ^ Pt{^{g)) for any t > 0, and hence 
$'((/) L ^ L<f>( 5 ). Thus, by invariance of p, we get that E^($'(g) Lg) 0, which gives the result when 
g = Ptf. Finally, the diffusion case (9) comes from the fact that we have then the so called “chain rule 
formula” 

r(a(g), h) = a'{g) T{g, h). (10) 

Recall that the operator L is a diffusion operator if and only if for any fi,..., fk in T>(L) and any smooth 
function a : ^ R such that a(/i, ..., fk) S P(L): 

k 

L(a(/i,...,/fc)) = ^(5ia)(/i,...,/fc)L/,+ ^ {d^ a){h,..., fk)P{fi, ff). (11) 

i—1 

We have to mention that the diffusion property makes really sense only in continuous space settings and 
implies roughly that L is a second order linear partial differential operator without constant part, cf. 
[Bak02, Bak94]. Finally, one can observe that the convexity of $ is needed only in order to give the sign 
in (7): the 4>-entropy Ent|^ is non-increasing along the Markov semi-group when $ is convex. 

Finally, notice that the term —^(E^f) in the definition of Ent^(/) plays no role in the non-increasing 
property along the semi-group since by invariance of p, one has —$(E^Pt(/)) = —$(E^/). Therefore, 
one can investigate the non-increasing property along the semi-group for generic {h, 4>)-entropies defined 
hyh{Eff^f))). □ 

Property (7) tells us that any <I>-entropy related to the invariant measure on a Markov process is 
non-increasing along the Markovian semi-group. Actually, an exponential decrease a $-entropy along the 
semi-group is equivalent to a functional inequality for p: 
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Corollary 2 (Exponential decrease of Ent* along a semi-group). There is an equivalence between: 

3c gR;, Ent;^(/)<-cE^(<i>'(/)L/), (12) 

and 

3c gR;, Vt>0, Ent^(Pt/)^e-‘/^Ent^(/). (13) 

Proof. Obvious from the DeBruijn like property stated in Proposition 1 by taking the derivative in t. □ 

Provided that X = R, one can adapt Corollary 2 to the “^-variance” given by (3). For Markov 
processes, the functional defined by: 

(14) 

JQ. 

can be seen as a generalisation of Fisher information, which corresponds to the cases $(x) = a; log a; or 
$(x) = (both are equivalent for diffusions due to the chain rule (10)). More generally, when X = R+, 
one can define the Kullback-Leibler <I)-entropy (or relative <I)-entropy) and the $-Fisher functionals for 
any couple of positive measures /i and v by: 


Ent^(i/1 ^) 


+00 if not. 


where $('u) := <!)(«) — $(1) u, and 





ii V ^ 
if not. 


(15) 


(16) 


Observe that $ inherits the convexity property from d) and that ($(0), <I)(1)) = ($(0),0) and that 
($)" = d)": one can always “correct” a convex function by an affine additive part in such a way that the 
new function vanishes at a fixed point. If ^ is a probability measure with dv := f dfx, one has 

Ent‘*'(:/|/x) = Ent^(/) and 1 ^) = J^(/). 

It would be interesting to study the role played by such functionals in Large Deviation Principles, since 
the “normal” entropy appears as a rate function, cf. [DS89]. The Kullback-Leibler relative entropy, which 
corresponds to $(x) = xlogx appears for example in Sanov Theorem as a particular convex conjugate 
functional on probability measures spaces. One can hope a sort of $-Sanov like Theorem involving the 
^-relative entropy, and the extremal case of the variance is interesting. 

Definition 3 (<l>-Sobolev inequalities). In accordance with Corollary 2, a probability measure p, as¬ 
sociated to a Markov process with generator L satisfies to a ^-Sobolev inequality of constant c G R+ on 
the class A C {f : fl ^ T} if and only if 

V/G A Ent^(/)<cE^(X^(/)), (17) 

where f X*(/) is a non-negative “energy” functional vanishing when f is constant. The precise choice 
of such functional will depend on the structure on the involved space kl and measure p. When S^{f) = 
—$'(/) L/, the “traditional” logarithmic Sobolev and Poincare inequalities correspond respectively to 
$(x) = xlogx and to $(x) = x^. In abstract settings, a ^-Sobolev inequality for p takes the form when 
X = IR._|_ 

y V <SP. p, Ent^(i/1 ^) ^ c J^(:^ I/x). (18) 

Beware that (17) gives (18) but the converse seems to be true only when <i)(x) = xlogx (nevertheless, it 
is always true for densities f with respect to p). Here again, one can prefer the name “^-Poincare” in¬ 
equalities, but it seems to he an illogical choice since log-Sobolev inequalities are not called “log-Poincare”! 
“Our” ^-Sobolev inequalities are close inform to the “Orlicz-Poincare” inequalities mentioned in [GZ03, 
p. 125-126]. The interested reader may find an extensive study of similar but quite different inequalities 
involving $(x) = Ixj*’ with p ^ 1 in [BZ02] and [BCRO]]. 
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Notice that the two classical $-entropies in (2) share the same special property: l/$" is affine, i.e. 
both convex and concave in the same time. Hence, if one try to generalise these two cases, one can assume 
that 1/$" is convex. As we will see, it turns out that it is not a good choice in view of deriving coercive 
inequalities like Poincare or log-Sobolev inequalities, for which the concavity of 1/$" is needed. Such a 
condition can be found for example in the unrelated works [LOGO], [HuOO] and [BR82a]. Nevertheless, 
convexity may gives inverse forms of such inequalities. Here are some possible additional assumptions on 
$: 

(HI) (m, v) is non-negative and convex onJ xJ ; 

(H2) {u,v) <i>(t6 + v) — ^{u) — is non negative and convex 

(H2’) is convex, non-negative and non-increasing on X; 

where := { {u, v) G K.^, (u, u + v) G X x X}. For convenience, we denote by the real valued function 
defined on X*^^^ by 

^'(u, v) := <I)(u + v) — <I)(u)— <&'(«) V. (19) 

Notice that (Hi) is equivalent to the convexity of and and —1/$", see Remark 29 page 22. On the 
other hand, (H2’) implies (H2), and (H2’) is equivalent to state that $, <!>" and —are convex. Basic 
examples for both (Hi) and (H2) and (H2’) are given by 

• <i)(a;) = a;logx on X = ]R_|_; 

• <I)(a;) = xP with l<p<2onX = 1R+; 

• <I)(a;) = on X = M. 

In such examples, the associated <i>-entropy EntJ'^ is homogeneous. Hypothesis (Hi) is suitable in continu¬ 
ous settings, whereas hypothesis (H 2 ) and (H 2 ’) are useful in discontinuous ones. The bivariate convexity 
of (m, v) <&"(«) under (Hi) and of under (H2) is the key property to derive $-Sobolev inequalities 
in the spirit of (17). Such functional inequalities may be investigated in many situations involving Markov 
processes: 


Time - Space 

Continuous space 

Discontinuous space 

Continuous time 

Diffusions processes 

Poisson space and Levy processes 

Discrete time 

1 Discrete Markov chains and martingales | 


Of course. Levy processes are martingales, but it is not the case for all diffusions. The discrete time case 
is not really addressed in this work, but one can consider Random Walks or Bernoulli processes, and 
some answers can be found for example in [GP03] and references therein. For each case, one may be 
interested in inequalities on the whole paths space or more simply in the law at fixed time (infinite for 
invariant measure by ergodicity). There is a lack of chain rule formula (10) for F operator in discrete 
space settings. However, i.i.d. increments can then help to recover a Brownian Motion like behaviour. In 
some cases, inequalities at fixed time can be tensorised to give multi-time inequalities which appears as 
inequalities for marginals of the paths space measure. A limiting procedure can be then used to recover 
inequalities on paths space (it is the so called cylindrical method). Roughly, we use here two types of 
proofs in order to extract <I>-Sobolev inequalities from “Markovianity”. The first kind makes use of the 
semi-group and the second kind, more powerful, is based on a martingale representation and gives directly 
results on paths spaces. Finally, it is probably possible to establish $-Sobolev inequalities under (Hi) on 
some loop spaces, as what is done in [GM98b] and [GM98a] for the logarithmic Sobolev inequality. 

The DeBruijn like property stated in proposition 1 can be seen as a sort of special case of Gateau 
directional derivative of <I>-entropy like functionals, see for example [BR82a], where the <I>-entropy is called 
J-Divergence. Actually, one can state the following Proposition: 

Proposition 4. Assume that is a probability measure and that $ fulfils CHI). Let L^’^(/i) be the set of 
measurable functions f : X such that f G L^(/i) and $(/) G L^(/x). Then L^’*(/i) is convex, the 

^-entropy functional f G L^’^(/i) i—> Ent*(/) is convex, and for any f G L^’^(/i), one has the following 
duality formula: 


Ent;^(/)= sup {E^((<I>'(/r)-<i>'(E^h))(/-/i))+Ent^(/i)}. (20) 
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Proof. For any couple f,g in L^’^(/i) and any t G [0,1], one has 

ht ■■= tf + {l- t)g G 

and ^{ht) G L^(/x) by convexity of $. Thus ht G L^’^(/x) and L^’'^(/x) is convex. Consider now the real 
function a : [0,1] ^ R+ defined by a(t) := Ent^(ft,t) and let us show that it is convex. Assume first that 
/ and g are bounded. An easy computation gives 

a'(t) = E^(<I>'(/rt)(/ - 9)) - $'(E^(/it))(E^(/ - g)), 

and 

a"(t) = - gf) - <I>"(E^(/iO)(E^(/ - 5))^ 

which is non-negative by virtue of Jensen inequality for the bivariate function {u,v) ^"{u)v^. Alter¬ 

natively, one can use the concavity of l/$" and Cauchy-Schwarz inequality. Therefore, a is continuous 
on [0,1] and convex on (0,1). In other words, for any s,t,X G [0,1], 

a(As + (1 — A)f) ^ Aa(s) -I- (1 — A)a(t). 

At this stage, the dominated convergence Theorem allows us to drop the boundedness assumption on / 
and g. In particular, a(A) ^ Aa(I) -I- (I — A)a(O) can be written 

Ent^iXf + (1 - X)g) < AEnt;^(/) + (1 - A)Ent^( 5 ), 

and the convexity of / G L^’^(/i) Ent|^(/) is established. Notice that the convexity of implies that 

the expression of a' is valid for any / and g in L^’'^(/x). Since every convex function is the envelope of its 
tangents, cf. [RW98, HULOl], one gets the following variational formula: 

Ent^(/) = a(l) = sup {a{t) + a'{t){l - t)}, 
te[o,i] 

which can be rewritten as (20) since for any fixed / in L^’*(/x), 

= [tf + (1 — t)g, where g G and t G [0,1]}. 

Notice that the value of the sup in (20) is achieved for h = f. □ 

The convexity of $-entropy like functionals is well known, see for example [BR82c], [BR82a, Thm. 
2] and [LOOO, Lem. 4]. One can show that this convexity is in some sense equivalent to (Hi) via well 
chosen / and h functions in (20). Formally, the Frechet derivatives of this functional are given by 

(iJEnt;^) {f){h) = E^([<I>'(/) - 4>'(E^/)] h ), 

and 

(iJ^Ent^) (/)(h, h) = E4<i>"(/) h^) - $"(E^/) (E^hf. 

We recover the well known formulas for variance and entropy by considering the appropriate 4>: 

Var^(/) = sup {2 Cov^(/, h) - Var^(/i)}, (21) 

h 

and 

Ent^(/) = sup{E^(/logh) - E^(/)logE^(h)}. (22) 

h 

The duality formula (20) for ^-entropies can be found in Convex Analysis and Information Theory 
literature, at least for discrete probability measures, see for example [BTT86]. It is quite straightforward 
via the convexity of ^-entropies under (Hi). We have to mention that Pascal Massart & al gave recently 
an elementary proof of this variational formula, cf. [BBLM04, Mas03]. See [BR82a] and [MMPS97] for 
further developments and a Bayesian and Riemannian Geometry point of view. 
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Remark 5 (Bi-convexity of relative $-entropy). Assume that! = ]R_|_ and let be a measurable 

space equipped by a positive Borel measure p. Let a and (3 be two probability measures on (fl, T), absolutely 
continuous with respect to qi, with densities p and q respectively. Then, the relative ^-entropy Ent'^(/3 | a) 
defined in (15) is given by 

Ent*(g;p) ■.= j^i^pdp. 

An easy computation shows that the convexity of $ induces the bi-convexity of the bivariate function 
{u,v) € 1 -^ S(v/u)u. Thus {p,q) i—> Ent)'^(( 7 ;p) is also bivariate convex. Notice that at fixed f, the 

^-entropy itself p Ent)'^(/) is concave! 

Remark 6 (Convexity and nnllity for constants). One can be surprised by the condition (Hi) 
requested for the convexity of the ^-entropy functional f Ent^(/). Actually, it is due to the presence 
of the — $(E^/) term in the definition of Ent)'^. If one removes this term, the resulting functional 
f 1 -^ E^($(/)) is always convex since $ is convex and (Hi) is not needed. Nevertheless, in that case, the 
functional does not vanishes when its argument f is constant, and thus it becomes useless as the left hand 
side of a Sobolev like functional inequality. 

2 Phi-Sobolev for diffusions and log-concave measures 

Proposition 1 translates the fact that the derivative in time of the <I)-entropy along a Markovian 
semi-group can be expressed in terms of the derivatives in space E, i.e. Fisher information. As we will 
see, for diffusions, a commutation formula between the semi-group and the derivatives in space yields 
finally local <I)-Sobolev inequalities. We learnt the following Theorem for hypercontractive diffusions from 
[HuOO, Sect. 3], where it is stated in a slightly different manner, see also [Bak02]. 

Theorem 7 (<l>-Sobolev inequality for diffusions). Let (AA,g) be a connected complete Riemannian 
manifold and let be a diffusion process on Ni with symmetric invariant positive measure p ab¬ 

solutely continuous with respect to the Riemannian volume measure. If a CD(/9, oo) criterion is satisfied 
with p ^ 0 (cf. (27), section 2.1, page 8), then under (Hi), one has: 

1. For any smooth function f : M. ^ T and any t G 1R+; 

En4,(/)<i^^R($"(/)r/), (23) 

where (Pt)t>o E are as in (4) and (5) and where the constant is tj^. when p = 0. 

2. If p > 0 and p is a probability measure and (W)t>o is if-ergodic then, for any smooth function 
f-.M^I: 

Ent;^(/)<lE^($"(/)r/). (24) 

Proof. The second part may be deduced from the first one via ergodicity by letting t tends to -l-oo. Let 
us give a direct proof. Let L be the infinitesimal generator of the diffusion semi-group. By the ergodic 
property and Fubini Theorem, it follows that 

Ent;^(/): = E^($(/))-$(E^/) 

= E^($(Po/))-E^($(P,,/)) 

E^(9t$(PJ)) 

E^(<I>'(Pt/)LPJ) 

cOO 

= +/ dt E^(<i>"(pj)rpj), 

Jo 

where the last equality is obtained by integration by parts (6) which is a consequence of the symmetry of p 
for L. Now, by the diffusion property, the CD(p, oo) criterion is equivalent to the following commutation 
formula (cf. Section 2.1 page 8): 

y/TPJ < e-^‘ Pt\/lV. (25) 
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Therefore, 


$"(?*/) rPJ ^ exp(-2pf) $"(P,/) (PtV^y, 
and then by Jensen inequality with the bivariate function which is convex under (Hi): 

ci>"(Pt(/))Pt(vW)' ^ Pi(<i>"(/)r/). 

Alternatively, one can use the concavity of l/$" and the Cauchy-Schwarz inequality. The desired results 
follows immediately from the invariance of /x: E^(Pt/) = E^(/). The proof of the hrst part (23) is quite 
similar. One just has to replace /x by P^, and Pt by Pt_s. The integration by parts and the chain rule 
leading to the expression in must be replaced by the following, relying on the diffusion property of L: 

d^p^mPt-sim ■■ = PsimPt-sif)) - <^>\Pt-sif))LPt-s{f)) 

= Ps{^''{Pt-.if))TPt-.{f)), 

where 0 < s ^ t. Notice that the diffusion property is used directly here, which was not the case for the 
non local inequality, for which it was used via the strong commutation property given by the CD(p, oo) 
criterion. As we will see, there is a lack of the diffusion property in discontinuous space settings, and one 
has to replace {u, v) by dx which is convex under {H2). □ 

See [HuOO] for Gross like hypercontractivity, F.K.G. inequalities and related aspects of <I)-entropies for 
diffusion semi-groups. We have to mention that the diffusion property leads to a more general result which 
contains F.K.G. and <I)-Sobolev inequalities as sub-cases, as explained in [HuOO, Thm. 4.2]. Namely, it 
states that if <I)i, $2 : X C K. ^ M are two smooth functions such that a particular 5x5 matrix involving 
the derivatives of d*! and ^2 is positive definite, then for any smooth / and g: 

Ent;^^’^^(/, g) < E4<I>'/(/)ci>2(ff)r/) 

+ E^($i(/) <i>''(g)rg) 

+ 2 E^(<i>;(/)<I>'(g)r(/, 5 )), 

where the quantity 

Ent^-^^(/, g) := E^($i(/) cl>2(ff)) - ‘I>i(E^(/)) $2(E^(5)) 
is a sort of ($ 1 , $ 2 )-covariance. One has in particular 

= Cov^ and Ent)]""!’= Ent*. 

Remark 8 (Non-negativity of functions). It is possible in some cases to reduce the analysis to 
non-negative functions. Namely, since Pt(|/|) ^ |Pt(/)|, one can write: 

E^(/L/) = ^hm 1 MfPtif) - f) 

^^hm 1 e4|/|P*(|/|)-|/|2) 

= E^(|/|L|/|), 

which gives Ey^(r|/|) ^ EM(r/). More generally, one can show by the same way that E^($"(|/|)r|/|) ^ 
E/i(4)"(|/|) r/). Moreover, one can assume that / ^ £ > 0 by Fatou Lemma. 

2.1 Geometry and log-concavity 

As we have seen in Theorem 7, a natural framework for <l>-Sobolev inequalities related to diffusions is 
Riemannian manifolds. Let L be a Markov generator with symmetric positive measure dfj, = exp([/) dvg 
on a complete connected Riemannian manifold {M.,g) equipped with its volume measure Vg. Function 
U : At ^ M is taken smooth. A basic example is given by the Laplace-Beltrami operator L = Ag -\-VU 
with vector held VG. Back to the generic case of a Markov generator L on {Ni,g), let us dehne the 
iterated functional quadratic forms Pf P{f, f) as is (5) and I2/ l 2 (/, /) by 

r 2 (/, 5 ) := i (r(/, 5 ) - r(/,L 5 ) - r{g,Lf )). 
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Then, for any (p, n) G R x N, the CD(p, n) (or Bakry-Emery I 2 ) criterion can be expressed as: 

r2^pr + i(L)2, (27) 

n 

where (L)^/ := (L/)^. For diffusions generators L, CD(/9, 00 ) is equivalent to the strong commutation 
formula (25) between -v/T and Pj, which is a direct consequence a Bochner formula^: 

r 2 (/,/) = (Ricg - V^C/)(V/, V/) + ||Hess/||^, 

see for example [Bak97, Bak94], [LedOO] and [ABC+00, Chap. 5]. Notice that with n = 00 , (27) appears 
as the infinitesimal form of (25), via the fact that: 

Ptf = f + tLf + ^t^LLf + oit^), 

cf. [ABC“'"00, Sect. 5.4]. In absence of the diffusion property, the Markov semi-group verifies the following 
weaker commutation formula under CD(p, 00 ): 

TPtf < PtPf. (28) 


This commutation formula leads to a 4>-Sobolev inequality when (m, v) <P"(u) v is convex on X x ]R_|_, 
which is the case if and only if 4)'" = 0, i.e. 4 )(m) = au^ + bu + c with (a, b, c) G R+ x (i.e. Poincare 
inequality). 

Let us consider the “simple” example where {Ai,g) is the standard flat Euclidean space (R^^jl^) and 
where L = A — VIE • V. The symmetric invariant measure fx is then given by dg,{x) = exp(—lE(a;)) dx 
and can be obviously normalised as a probability measure if and only if exp(—IE) G L^(R‘^,R, dx). One 
has r/ = |V/|^ and RiCg = 0 and I 2 / = HV^/H^ + V/ • V^IEV/. Now, if there exists a real number 
p ^ 0 such that 

Vx G R'^, V^W{x) ^ pld 


as quadratic forms on R"^, then L satisfies CD(p, 00 ). Moreover, if p > 0, measure p is uniformly strictly 
log-concave and can be normalised as a probability measure. Obviously, measure p can be finite without 
being uniformly strictly log-concave. 


Corollary 9 (4>-Sobolev for uniformly strictly log-concave densities). Let p, be a probability 
measure on R*^ absolutely continuous with respect to Lebesgue measure dx with smooth density. Assume 
that p is uniformly strictly log-concave, i.e. that there exists p > 0 such that dp{x) = exp(—i7(a;)) dx 
where H : R'^ ^ R zs smooth and (Ress{H){x) y,y) ^ PII 2 /II 2 for any x and y in R*^. Then under (Hi) 
and for any smooth function / : R*^ ^ T: 

Ent;^(/) < ^ ($"(/) I V/|2). (29) 

In particular, when p = N{m, E) is a Gaussian measure of mean vector m G R'^ and positive definite 
covariance matrix E G Sym])'*(R), one has 


p ^ = max((T(E)) 


where cr(E) is the spectrum ofD. Moreover, the following Brascamp-Lieb type inequality holds: 

Ent^(r„,s) (/) ^ ^ E^(™,s) (d>"(/) (E V/; V/)), (30) 

and inequality (30) remains valid when E is singular. 

Notice that (29) is obtained by the comparison Hess(i7) ^ pld to the standard quadratic form (•;•) = 
II -jj^ corresponding to the diagonal covariance matrix E. Such a comparison is essentially unidimensional 
and imposes a geometric information loss. Recall that the Brascamp-Lieb inequality for the strictly 
log-concave probability measure p on R"^ reads 

_ V/ : R^ ^ R smooth , Var^(/) ^ E^(((Hess(i7))-iV/; V/)) , 

^ Actually Bochner-Lichnerowicz-Weitzenboch. 
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where fi/x(a:) = exp{—H(x)) dx. When = Af{m,Y,) with non singular S, one has Hess(i7) = 
Unfortunately, to our knowledge, and except for Gaussian measures, Brascamp-Lieb forms of $-Sobolev 
inequalities are available only for Poincare inequalities, i.e. when <I>(m) = u^, cf. [BLOO] for a counter 
example in the logarithmic Sobolev case. However, if /i satisfies to a <f’-Sobolev inequality on with 
constant c and right hand side Ey^^<i)"(/) |V/|^^, one can ask about the best deterministic symmetric 
matrix S such that 

Ent;!(/)^cE^($"(/) {SVf;Vf)). 

Back to the case where L = A —V TU-V on an interesting “degenerated” case is given by W{x) = \\x\\l. 
with r G {0} U (1,2) U (2,+oo) since V^lU(x) is then singular at a; = 0. One has then only CD(0,oo), 
inducing a local version only: 

Ent^^(/) < t ?,($"(/) IV/l"), (31) 

which is the Brownian Motion or (heat semi-group) behaviour corresponding to the case r = 0 and 
dp,{x) = dx. One can expect a better inequality in term of t-dependence of the obtained constant, but 
the semi-group method seems to fail since it relies on a commutation formula (25) which is poor when 
r ^ {0,2}. A perturbative approach (cf. Proposition 17 page 15) may be used for the inequalities related 
to /i when r > 2, which corresponds to t = -foo, but the constant is not sharp in general (cf. [ABC''"00, 
Chap. 6]). 

When V^IU is constant, (W)t^o is a Brownian Motion or an Ornstein-Uhlenbeck process, and the 
commutation formula (25) is exact, i.e. an equality. One can try to investigate the commutation between 
/ 1 -^ $"(/) r/ and Pt, with a cost related to a sort of convex conjugate of <&. The idea is to relate 
with a criterion involving E and I2 (and thus W when L = A — VIU • V). 

2.2 Interpolation between Poincare and logarithmic Sobolev 

For simplicity and notational convenience, we restrict ourselves here to Assume that $ : X ^ R 
is convex and smooth and that the probability measure /r on R^* satisfies to a $-Sobolev inequality of the 
form: 

3c>0, V/GCr(R",I), Ent;^(/)^cE^(<I>"(/)|V/|"), 

O 

Now, for any g G C“(R‘^,X), let us consider a G X such that <l>"(a) > 0 and /g := a + eg with £ > 0. 
Then, /g G C“(R‘^,X) for £ sufficiently small and by Taylor formula for $ and the $-Sobolev inequality 
above for /g, one gets when e goes to 0+: 

Var^(5)^2cE^(|V5|"). 

Therefore, Poincare inequality can be seen in a sense as the weakest ‘k-Sobolev inequality since it is 
implied by any <I)-Sobolev inequality, at least on C“(R‘^,X). Moreover, let us assume that R+ C X and 
let us take now fs = a + e g^^^ where g is positive and p G (1,2], then we get by the same way: 

E^igP) - E^ % 2 cp2 E^ (| Vgj" . 

Now, since p G (1,2], Jensen inequality yields E^(gP/^)^ < E^(g)*’, and therefore: 

E^(g^’) - E^(g)^ < 2 c E^ (p{p - 1) gP-2 1 Vg]") , 

which is a $-Sobolev inequality with <I)(a;) = on X = R+. However, the constant does not give the 
sharp one obtained in Corollary 9 when p is log-concave, and there is no way to recover a logarithmic 
Sobolev inequality by letting p tends to 1+ like in (35) since 2cp/{p — 1)^ blows up near p = 1. This 
fact in not surprising since one can start from a Poincare inequality in the latter, which is known to 
be strictly weaker than logarithmic Sobolev inequality (the simplest counter-example is given by the 
exponential probability law on the real line, see also [ABC+OO, Chap. 6]). Recall that the $-Sobolev 
inequality obtained in Corollary 9 for log-concave probability measures include the optimal Poincare and 
logarithmic Sobolev inequalities by considering the appropriate $, namely: 

V/ G W1’°°(R^R;), Ent^(/) < E , (32) 
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and 


(33) 


V/ e Wi’“(K^K), Var^(/) ^ E^(|V/|") , 

and more generally, for any p G (1,2]: 

V/ G W1’“(R^R;), Ent^^""(/) < 1e^(p(p- l)/^’-2 |V/|") . (34) 

Recall that Ent^ = and that Var^ := Ent^'^^ . The W^’°°-regularity is not optimal, but 

we are not interested here in such aspects. Notice that |V/|^ = |V/p/^| . The particular 

constant in p appearing in (34) allows to derive the logarithmic Sobolev inequality by using the fact that 
for a positive /: 


E,{n E^(/)p ^ ^ ^ 

p — i 

Therefore, the logarithmic Sobolev inequality appears as a sort of limiting case of a family of ^-Sobolev 
inequalities associated to $(a;) = with p close to 1+, and the constant is sharp when p is log-concave. 
In another direction, by an appropriate change of function, one can reformulate the p-inequality (34) as 
follows: 

V/ G w1’“(r^r;), e^(/2) - e^(/2/p)'’ < ^T^e^(|v/|') , 

which gives by denoting q = 2/p, i.e. q G [1,2): 

V/ G w1’“(r^r;), E^(/2) - E^(/«)"/« ^ E^(|V/|"). 

By removing the positivity condition on / as explained in Remark 8, we get finally that for any g G [1,2): 

V/ G Wi’“(R^]R), E^(/") - E^(|/n"/" < E^(|V/|") , (36) 

which is exactly the inequality studied in [Bec89] for the Gaussian measure and in [LOGO], [Wan02], 
[BarOl], [BR03] and [BCR04] for further developments related to this particular case. Notice that in 
(36), the energy term E^^|V/|^^ does not depend on q, and one can adopt the following more convenient 
formulation: 

V/G Wi’“(R^R), sup )-E^(l/I ) ^1 e^(|V/|T (37) 

9G[1,2) (2 — <Z) P ^ ' 

Generalisations of this type of statement where (2 — g) and 1/p are replaced by a more general functions 
are addressed in [LOGO] and [WanG2]. Actually, inequality (36) appears as a sort of infinite dimensional 
“dual” version of a Sobolev inequality. Namely, let (A4,g) be a smooth compact connected Riemannian 
manifold with dimension d ^ 3 and Ricci curvature p > G, and let p be the normalised Riemannian 
volume probability measure. The Sobolev inequality states that for any real valued smooth function / 
on M: 

E^(|/|«)"/^ - E^{f) E^(|V/|") , (38) 

where g := 2d/{d— 2) and |V/|^ denotes the length of the gradient of / on A4, see for example [LedGGj. 
Such an inequality is stronger than logarithmic Sobolev inequality which is stronger than Poincare in¬ 
equality. When A4 is the standard d-dimensional sphere S'^(r) C of radius r = -y/d, one has 


d — 1 g -t- 2 



It is then well known that one can recover the optimal logarithmic Sobolev inequality for the standard 
Gaussian measure on when p = ^(0, I^) by taking the projection ak ■ ^ where k ^ d and by 

letting d tends to -l-oo (i.e. g tends to 2). This fact is sometimes referred as the “Poincare observation”, 
see for example [JohG3] for a generalisation to the case where p is a Boltzmann-Gibbs product measure. 
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Remark 10 (Prom $-Poincare to $-Sobolev). It is shown in [Wan02] that for any p G [1,2] and 
any f G L^(ri, ^,M) where p is a probability measure on H that: 

E^(/2) - E^(|/r)"/" ^ Var^/) + (1 - p) Var^^(/)^/^ . 

It appears as an extended version of a Rothaus-Deuschel-Stroock-Bakry inequality to the case p G [1,2], cf. 
[ABC2~ 00, Lemmas 4-3.7 and 4-3.8]. In presence of Poincare inequality, it can be used perhaps to deduce 
a ^-Sobolev inequality for <l>(a;) = from an hypothetic ^-Poincare inequality involving the ^-variance 
(3). 

Remark 11 (Sets convexity). For any a G {1,2,2'}, let¥,{a,T) be the set of smooth convex functions 
from 2 to R such that (Ha) holds. It is easy to see that these three sets are convex vector cones, i.e. stable 
by linear combinations with non-negative scalar coefficients. For example, for any A := (Ai, A 2 , A 3 , A 4 ) G 
(R_i_)^ X R^ and any p G (1, 2], the function a : R+ ^ R defined by 

A (a;) := Ai a;^ + A 2 a; log a; + A 3 a; + A 4 

is in E(1,R_|_) n E( 2 ,R 4 _) n E(2',R_|_). It is clear that functions $ 2 , a are extremal points of the convex 
cone E(l, R_|_). One can observe that for any probability measure p and any interval X o/R, the functional 
valued functional $ G E(a,X) 1 -^ Ent['[’ is the restriction of a linear functional over the cone E(q;,X), which 
is not sensitive to A 3 and A 4 . The same linearity property holds for the associated ^-Sobolev inequalities 
which are moreover invariant by any dilatation in A. Thus, we understand that in view of ^-Sobolev 
inequalities, A 3 and A 4 plays no role and that $ 2 , ei and $ 7 ^ 62 which corresponds respectively to classical 
variance and entropy are “extremal” cases under (Hi). It could be nice to try to use a sort of Choquet 
integral representation by mean of extremal points, cf. [FLPOl]. 

3 Perturbation, tensorisation, convolution, concentration 

We explore here the stability of $-Sobolev inequalities by tensorisation, convolution, and perturbation. 
We give also some concentration of measure consequences. 

3.1 Tensorisation 

A sub-additivity property for ^-entropies under (Hi) can be found in [LOGO, Cor. 3]. However, 
we provide in the sequel a short and simple proof relying on the convexity of the <I>-entropy functional 
established in Proposition 4. Duality and sub-additivity formulas are well known for variance and entropy, 
cf. [Led97] and [ABC“''00, Chap. 1]. The tensorisation property for ^-entropies can be seen as an 
extension of the sub-additivity of Kullback-Leibler relative entropy. Beware that it is different from the 
also well known sub-additivity of Shannon like entropies, cf. Section 6 page 23 for an explanation. 

Recall that for two probability measures v and p on a probability space (O, !F) with v p , the 
Kullback-Leibler relative entropy Ent(^ j p) is defined by: 

^ , , , f du , dv , ^ f dv , dv\ ^ f, dv\ 

i.51 = 

Let (Hi X • • • X H„, X"! 0 • • • 0 iF„, pi0 ■ ■ - 0 pn) be a product probability space. Then, for any probability 
measure n pi 0 ■■■ 0 Pn- 

Ent(i/ \pi0---0pn) ^ E^(Ent (:7 j p^)) -|-h E^(Ent (:7 j ^„)), (39) 

where 

I]nt{v\Pi) :^'Ent^.{f) = [ flogfdpi- [ f dpAog [ f dpi, 

J Cli J 0,i J 

and / := dv/pi0 ■■■ 0 Pn. Moreover, equality holds if and only if ly = vi 0 ■ ■ ■ 0 In other words, for 
any real valued integrable function / on the product space: 

Ent^,ig,...ig,^^(/) ^ E^(Ent^, (/)) -I- • • • -I- E^(Ent^^(/)). (40) 

This property of relative entropy remains true for <i>-entropies when <f> fulfils (Hi): 

Proposition 12 (Tensorisation of ^-entropies). Suppose that $ satisfies (Hi). Let (Hi x ••• x 
H„,X'i 0 ■■■ 0 (Fn, Pi 0 ■■■ 0 Pn) be a product probability space. Then, for any measurable function 
f : in,J^) (X,]B(X)) one has: 

(/) E^ (Ent;^^ (/)) + • • • + E^ (Ent;^„ (/)) . (41) 
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Proof. As for variance and entropy, (41) is a consequence of the convexity of the <I>-entropy functional, 
and can be obtained by using the variational (duality) formula for ^-entropies (20). Namely, for any h 
and i G {0,..., n}, let hi := (h), with ho = h. Thanks to the variational formula (20), one has: 

Ent;^(/) < E^(($'(/ro) - <^'{h^)){f - h)) - Ent‘^^{h). 


Since we can write 

n 

$'(/lo) - $'(/i„) = ^ ($'(/^^-l) - <^'{hi)), 
the desired result follows by the variational formula (20) again and the fact that 

Ent;^(/r) ^ E^(Ent;^^(/i,_i)) , 

which is due to Jensen inequality for the convex function $. □ 

The tensorisation property for ^-entropies can be used to tensorise $-Sobolev inequalities as what is 
done for logarithmic Sobolev and Poincare inequalities, which can then be viewed as particular cases. In 
essence, <i>-Sobolev inequalities under {Hi.) are then infinite dimensional since they hold on the product 
space with the maximum of the one dimensional constants. 

3.2 Convolution 

For any x G we denote by Tx the translation of vector x in acting on any function f :W^ ^ R 
by {tx ■ f){y) ■= f{x + y). One can state then the following immediate Corollary of Proposition 12. 

Corollary 13 (Stability of $-Sobolev inequalities by convolution). Suppose that $ satisfies (Hi). 
For any i G {!,.. ■ ,n}, let yi he a probability measure on satisfying to a ^-Sobolev inequality of the 
form 

3ci > 0, y f : ^ I smooth, Ent)^. (/) ^ E^, (f ^(/)) , 

where f S'^{f) is a non-negative functional depending on $ and its derivatives such thatSoTx =TxoS 
for any x gR‘^. Then, one has for any smooth function f ^ T 

Ent;^i*...*M„ (/) < (ci + • • • + c„) (£:^(/)) . (42) 

The commutation property E otx = TxoE is satisfied for example when one has £'^{f) = 4)"(/) |V/|^, 
and one can then check the optimality of the obtained constant ci+- • •+€„ when fii are Gaussian measures. 
The Poisson measures with suitable energy E^ give another example for which constant is optimal, cf. 
Section 5. We believe that Corollary 13 remains essentially the same if one replaces the Euclidean space 
by an infinite dimensional Banach space or by a topological Abelian group (and maybe any Lie 
group). It can be useful for processes with independent increments, in discrete or continuous time, and 
for infinitely divisible laws. However, we will use different methods in the sequel, since i.i.d. increments 
and/or infinite divisibility can be used by other ways to provide the same result. 

Remark 14 (Invariance by action of the translation group). Notice that hypothesis E otx = TxoE 
for any x G ensures that any associated ^-Sobolev inequality is “invariant by translations in base 
space”. Namely, if one has 


3c>0, V/gA, Ent;^(/)<cE^(£:^(/)), 

then for any x G R"^, 

V/ G T_, • A, Entl.^if) ^ - 

where the action of Tx on measure y is given by Tx • y := y * Sx. Thus, if the class of functions A is 
invariant by translations, i.e. Tx ■ A = A for any x G R'^, then any element of the orbit {y * 6x, x G R"^} 
satisfies to the ^-Sobolev inequality satisfied by y, with same constant c and class A. Here again, we 
believe that things remain essentially the same if one replaces R'^ by an infinite dimensional Banach space 
or by a topological Abelian group. Notice that when <i>(a;) is of the form xlogx or x^ with p G (1, 2], the 
associated ^-Sobolev inequality on R"^ with E{f) = $"(/) |V/| is homogeneous, and thus is additionally 
stable by dilatations. 
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Actually, translations are particular examples of Lipschitz functions and one can expect a sort of 
stability by Lipschitz transforms when the energy functional £ satisfies some stability. The following 
remark gives an answer. 

Remark 15 (Invariance by action of Lipschitz functions). Let 0 : ^ be a measurable map, 

acting on a function f : Mf' ^ R &j/ (0 • f){y) := /(0(y)) for any y G R'^. Assume that the probability 
measure y on R*^ satisfies to the following ^-Sobolev inequality: 

3c>0, VgeA, Ent^( 5 )<cE^(£:^(g)), 

where is a non-negative functional. Assume that there exists a constant a > 0 such that for any 
g G A, {£‘^ o Q){g) ^ a (Q o £‘^)[g). Let us denote by A ■ & the set of functions / : R'^ ^ X such that 
Q • f G A. Then, one has 

\ffGA-e, EntS.^/) ^caEe.4£^(/)), 

where Q ■ y := yoQ~^ is the image measure of y by O, i.e. the law of Q under y. An important example 
is given by £^{g) = ^"{g) |Vg|^ and by any smooth map 0 such that ||Jac(0)||2 ^ \/cx. Ln particular, 
when d = d', such ^-Sobolev inequalities are stable up to constants by the action of the non-Abelian group 
of diffeomorphisms with “bounded Jacobian”. Actually, in many cases including £^{g) = ^"{g) 
only a weak smoothness of 0 is needed, and for example one can assume only that 0 is Lipschitz when 
d' = 1. 

As an immediate consequence of Corollary 13 and Remarks 14 and 15, one can deduce the following 
result. 

Corollary 16. For any i G {1,... ,n}, let yi he a probability measure on R*^* satisfying to a ^-Sobolev 
inequality of the form: 

3,Ci>0, V/: R^- ^ X smoot/i, Ent)^^ (/) ^ c, E^, ($"(/) |V/|") , 

where $ satisfies (Hi). Let d be in N*. For each i = l,...,n, let Qi : R'^* ^ R'^ 6e o smooth (resp. 
Lipschitz when d = 1) map with |jJac(0i)||2 ^ y/ai (resp. ||/||Lip ^ \/cei when d' = 1). Let y be the 
probability measure on R'^ defined by 

y:={Qi-yi)*---*{Qn-y-n)- (43) 

Then, for any smooth function / : R"^ ^ X 

Ent;^(/) < (cioi + • • • + c„a„) E^ ($"(/) |V/|') . (44) 

Probability measures like in (43) appear for example in Statistics as fixed points of some kernel 
estimators. A very special Gaussian case of (43) was studied in [BK02] when <I)(a;) = xlogx and 
{n,di,d) = (1,2,1). In view of some applications to Statistics and Statistical Mechanics, the natural 
next step is to explore the stability of 4>-Sobolev inequalities by general mixtures of the form 

E^(/) :=^E^,(/) 

and conditioning of the form y = C{{Xi ,..., XA) \ F{Xi ,..., Xn) = m) for example. But such aspects 
are far more delicate and intricate and will be hopefully the aim of forthcoming papers. 

3.3 Perturbation 

One can derive a perturbation property for the 4>-Sobolev inequality via the following straightforward 
variational formula for ^-entropies: 

Ent;^(/)= inf E^(<I>(/) -$(«)-$'(«)(/-a)). (45) 

This formula is nothing else but the consequence of Taylor formula for the convex function $, and no 
more assumption on $ are required here. It can be found in [ABC+OO, Lem. 3.4.2]. One can easily 
recover the well known formulas for variance and entropy by considering the appropriate $: 

Var^(/) = inf E^((/- a)2) 
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and 


Ent^(/2) = inf^ E^(/2 log(/Va) - a + f) . 

Such a variational formula (45) allows a perturbation of $-Sobolev inequalities via the method used by 
Holley and Stroock for Poincare and logarithmic Sobolev inequalities, cf. [HS87]. Namely, let /i be a 
probability measure on (H,1F) and B : (H,1F) —!■ (R, B(]R)) be a bounded measurable function. If we 
define the probability measure vb on (H,1F) by: 

di/B ■= (46) 

where Z^^b ■= E^(exp(i?)), then one can write: 

Ent^^(/) = inf /($(/) - $(a) - $'(a) (/ - a)) di^B 
aei 

^ exp (— inf(i?) + sup(i?)) inf f (<!)(/) — <I)(a) — 4)'(a) (/ — a)) dfi 

aex 

= exp(osc(H))Ent)^(/). 

One can express the result as follows: 

Proposition 17 (Perturbation). Let he a probability space such that 

3c gR;, V/gA Ent;^(/)<cE^(£:(/)), 

where A is a class of real valued measurable functions on {LI, IF) taking their values in I and where 
S : A ^ L^(H,1F,/x, IR+) is a functional. Then, for any bounded measurable function B : {LI, IF) R, 
one has: 

V/ G A Ent^J/) < c E,, (£(/)), 

where vb is like in (46). 

3.4 Concentration of measure 

It is well known that a logarithmic Sobolev inequality for /i on R'^ of the form 

3c> 0, V/ : R”^ ^ R smooth, Ent;)""“‘°s“(/2) ^ cE^(|V/|^) 

gives, when applied to / = exp(AE') where F : R"^ ^ R is 1-Lipschitz, a Gaussian like exponential upper 
bound for the Laplace transform of £^(E): E^(exp(AF)) < exp(ct^). Such a bound can be then used via 
classical Chebychev-Markov-Chernov-Cramer approach to give a concentration inequality for F around 
its mean: 

/x(|E - E^(F)| ^ t) < 2 exp(-<^/c). 

The t'^ comes from the fact that the Young-Fenchel-Legendre convex conjugate of a; p~^ is x 
q~^ x* where q := p/{p — 1) is the Holder conjugate of p, and thus q = 2 when p = 2. This method is 
known as Herbst argument and gives precise and non-asymptotic bounds which strengthen large devia¬ 
tions results. The concentration bound obtained from logarithmic Sobolev in discrete settings are only 
Poissonian, i.e. exp(—cmin(t^, tlogt)), due to the lack of chain rule. At the opposite side of the set of 
possible $ functions, the Poincare inequality gives by the same method an exponential like concentra¬ 
tion, i.e. exp(—ct). See for example [Led99, LedOl] and [ABC+00, Chap. 7] for a general approach to 
concentration of measure via functional inequalities. It is tempting to study the concentration of mea¬ 
sure consequences of generic 4>-Sobolev inequalities, and one can expect intermediate exponential speeds 
between and t. Let us recall what can be found in [LOGO] and [BarOl]. Let /x be a probability measure 
on R'^, and a G [0,1] and r = 2l{2 — a) (i.e. r G [1, 2]). Assume that there exists a constant C > 0 such 
that for any q G [1,2) and any smooth function / : R"^ ^ R: 

E^(/2) -E^(|/|«)"/« ^ C(2-g)“E^(|V/|") . 

We have seen already that such an inequality can be deduced by a change of function from a 4>-Sobolev 
inequalities with $(x) = Then, the following concentration of measure holds for any t > 0 and 

/x-integrable 1-Lipschitz function F : R'^ ^ R: 

p,(^F - E^(F) > VCtj < exp {-KF), 
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where if > 0 is a universal constant. See [Wan02, BarOl, LOGO, BCR04] and references therein for further 
developments. One can hnd a quite recent account in [Lug04]. Some aspects of concentration of measure 
consequences in discrete space settings are addressed in [WuOO] and [Led99] and references therein. 

Theorem 7 page 7 gives <f’-Sobolev inequalities for hypercontractive diffusions and uniformly strictly 
log-concave probability measures, i.e. what is under Bakry-Emery I 2 criterion (27). It is quite natural to 
ask for a general criterion to establish such inequalities beyond this scope. One can find some answers in 
[LOGO] and [Wan02] for <I>(x) = x^. Namely, let a G [G, 1] and r = 2l{2 — a) (i.e. r G [1,2]), and consider 
the probability measure on defined by: 

dvr(x) := Z~^ exp (-||a;||]]) dx. 

Then there exists an universal constant (7 > G such that for any q G [1, 2) and any smooth function 

< C (2 - qr E,, (|V/|") , 

which is exactly after the suitable change of functions the $-Sobolev inequalities for $(x) = One 
can try to investigate <I>-Sobolev inequalities on the real line via Hardy type inequalities, like what is 
known for Poincare and logarithmic Sobolev inequalities, cf. for example [ABC+GG, Chap. 6]. As we will 
show in the sequel, <I>-Sobolev inequalities can be established on paths space of some diffusions (under 
(HI)) and some Levy processes under {'H2), extending by this way what is already known in the literature 
for Poincare and logarithmic Sobolev inequalities. 

4 Phi-Sobolev for Brownian Motion and Wiener space 

Consider the standard Brownian Motion {Bt)t^o on starting from Bq = 0. Since C{Bt) = N{Q, 1 1^) 
is uniformly strictly log-concave with constant 1/t, it satisfies a d’-Sobolev inequality (24) of constant t/2 
under (Hi) for $: 

Vt > G, V/, Ent^(/(Hd) < I m'\fiBt)) |V/|'(i?d)- 

Let G < < • • • < be n successive times, and let F : R" ^ R be a smooth function. Then, one can 

write: 


Ent^(F(Hi,, Ht,, ...,BtJ) = Ent^(F(gi, Qi + Q2,..., + ... + Q„)) 

= Ent^(G(Qi,...,Q„)) 

= Ent£(Q^ g^)(G), 

where Qi := Bt^ — Bt^ -^ and to := 0 and 

G(xi,... ,x„) := E(xi,xi -I- X2,... ,xi -I-h x„). 


But now, since Brownian Motion has i.i.d. increments, one gets 

C{Qi ,..., Qn) = A/'(0, (G - to) Id) C) • • • O A/'(G, (t„ - t„_i) Id). 


Therefore, the tensorisation property (41) yields the following result. 

Theorem 18 (Multi-times <I>-Sobolev for Brownian Motion). Let {Bt)t^o be a standard Brownian 
Motion on R'*. Assume that $ fulfils (Hi). Then for any sequence of times to := 0 < ti <■■■< tn and 
any smooth function F : R” ^ T: 


Ent 




(47) 


where 




ti-i) 



Moreover, the inequality remains true when to := 0 ^ ti ^ ^ t„. 


(48) 
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One can ask if (47) is a consequence of (30). If we define the n x n square matrices T and Q by 
T := diag(^/ti — to,..., y/tn — tn-i) and Qij := Sj^i, then S := {TQ)^TQ is the covariance matrix of 
the centred Gaussian vector (i?ti,... In fact, Tnj := U Atj, and the (I]w;v) quadratic form in the 

right hand side of (30) reads 

{'Sv;v) = (TQv)^TQv 

Vjj , (49) 

which is exactly the quadratic form appearing in the right hand side (48) of (47). Therefore, for Gaussian 
measures, the <I>-Sobolev inequality (47) obtained by applying the tensorisation formula (41) to the 
unidimensional case (29) (i.e. with d = 1) is exactly the Brascamp-Lieb type 4>-Sobolev inequality (30). 
The direct use of (29) for the multivariate Gaussian measure C{Bt^, ■ ■ ■, Bt^) gives the quadratic form 
(maxcr(E)) where cr(S) is the spectrum of S = TQ. Thus, inequality (29) for Gaussian measures 

is a direct consequence of (47). Notice that det(S) = (ti—to) ■ ■ ■ (t„ —t„_i) and that Tr(S) = ti + - ■ ■ + tn- 
As we will see, inequality (47) appears as a particular case of a more general one on paths space (i.e. 
for the Wiener measure). The same procedure may be used for any random walk (resp. Levy process), 
provided that the law of the increment (resp. the infinitely divisible law at time t = 1) satisfies to a 
4>-Sobolev inequality under {'Hi) for <I> (we need a tensorisation property). Infinitely divisible laws are 
particular cases of laws of sums of i.i.d. random variables. As we will see, one can use for such laws the 
semi-group approach directly via the associated Levy process, to establish $-Sobolev inequalities with 
linear constant in time when $ satisfies (A2), just like for Brownian Motion. 

One can derive the 4>-Sobolev inequality for the Wiener measure on by a cylindrical method, 
starting from the multi-times 4)-Sobolev inequality (47) (established by tensorisation) by letting the 
number n of times considered tends to -l-oo. However, $-Sobolev inequalities involving a Malliavin 
derivative can be easily derived on paths space via a martingale representation approach and Ito formula 
as what was done for the log-Sobolev inequality in [GHL97]. Actually, one can state the following 
Theorem. 

Theorem 19 (<I>-Sobolev for the Wiener measure). Let Wo(K‘^) be the paths space of eontinuous 
functions from [0,1] to Mf' starting from 0 at t = 0, measured by the standard Wiener measure. Assume 
that 4) fulfils CHI). Then for any random variable F G L^(Wo(K‘^), I): 

Ent^(A)<iE(4>"(F)|I?F|y , (50) 

where T) : L^(/r) ^ L^(/i,]HI) is the Malliavin gradient operator on Wo(K‘^) and H is the Cameron-Martin 
Hilbert space. 

Proof. For every functional F : Wo(M‘^) ^ T, consider the martingale Mt := E(E | fF^) where 0 ^ ^ 1 

and Ft = (^{Bs, 0 ^ s ^ t) is the natural filtration. Then, one has 

dMt = (E{{^F)^\Ft),dBt), 

where {DF)^ denotes the directional derivative of F. Now, Ito formula yields: 

E(4>(M0) - E($(Mo)) = |E((DF), | Ft)\^ d?j , 

which is nothing else but 

= i E (^V'(E(F | Ft)) |E((DE), | Ft)\^ d^ . 

Now, by Jensen inequality for the bivariate convex function (u, v) i-^- <I’(w)" we get as in the semi-group 
proof of Theorem 7: 

4>"(E(E I Ft)) |E((DE), | Ft)\^ ^ e($"(F) |(DF)/ | Ft) , 

which gives the desired result. Here again, one can use alternatively the concavity of and Gauchy- 
Schwarz inequality. □ 


= Y^{U-U-i) I ^ 


2=1 
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Notice that the proof is a replica at paths space level of the semi-group proof for diffusions in Theorem 
7, where Pt is “replaced” by 'E,{F\Tt) and the diffusion property by ltd formula. This analogy is not 
formal indeed, since the diffusion semi-group gives the solution of Stroock-Varadhan martingale problem 
associated to the related elliptic diffusion operator, see for example [KS91, Chap. 5, Sect. 4]. Such 
a proof can be extended to paths spaces on manifolds as what was already done for the logarithmic 
Sobolev inequality in [CHL97, Wan96] (see also [Hsu02, Sect. 8.3] and references therein), as stated in 
the following Theorem. 

Theorem 20 (<i>-Sobolev for Brownian Motion on a Manifold). Let {Ai,g) be a smooth eomplete 
and eonneeted Riemannian manifold equipped with the Levi-Civita conneetion. Let x G M and Wx{M) 
be the space of eontinuous paths 7 : [0, 1] ^ A4 with 7 ( 0 ) = x. Assume that the Ricci curvature of Ai is 
uniformly bounded by the real number K. Assume that $ fulfils (Hi), then for any smooth non-negative 
random variable F on Wx{M): 


Ent^(F)<ie^E(<i>"(F)|I?F|y . (51) 

When A4 is the standard d-dimensional Euclidean space one has K = 0 and we recover Theorem 
19. We believe that <I)-Sobolev inequalities under (Hi) still hold on the paths space of diffusions on 
manifolds with Driver total antisymmetry condition, as what was done in [CHL97] for the logarithmic 
Sobolev inequality. We have to mention that this beautiful martingale method for Wiener measure over 
Riemannian manifold appeared for the first time in [Fan94] for the Poincare inequality. We will use 
roughly the same method under (H2) on Poisson space in Section 5.1 page 20. 

5 Phi-Sobolev for pure jumps Levy processes and Poisson space 

It is tempting to try to establish <I)-Sobolev inequalities on Poisson space. In particular. Levy processes 
have i.i.d. increments like Brownian Motion. Such processes are not diffusions, and the lack of the chain 
rule (10) forbids the direct use of the Bakry-Emery semi-group proof. More precisely, for all Markov 
processes, E ^ 0, and the CD(/9, 00 ) criterion is equivalent to I 2 ^ pT. But the equivalent form in term 
of strong commutation (25) between the semi-group and E is available only for diffusions. Nevertheless, 
for Levy processes, one has I 2 > 0 and indeed I 2 ^ tVt, which gives Brownian Motion like commutation 
formulas and then permits to derive respectively Poincare and log-Sobolev inequalities, see [ALOO] for 
the simple Poisson point process and [CM02] for more general Levy processes. More general approaches 
are presented in [WuOO] (generic Poisson space) and [PriOO] (normal martingales), but here again only 
for Poincare and log-Sobolev inequalities. 

Let us start with the simplest pure jumps Levy process (Xt)t^o which is the simple Poisson point 
process with intensity A > 0, for which 

(L/)(x) := A (/(x + 1) - fix)) =: A (Di/)(x), (52) 

where {Byf)(^x) := /(x -\-y)- fix). If g := Pt-sif), one has: 

dsPsmg)) :=P«(L<l>(5)-<i>'(5)L5) =AP,(vI'(5 , Dig)), 

where 4* is defined by (19). But now, under (H2), 4' is bivariate convex on It was already observed 
for the simple cases 4>(x) = x^ and 4)(x) = xlogx in the pretty paper [WuOOj. Hence, by the bivariate 
Jensen inequality and the commutativity property between Di and P(_s (due to the i.i.d. nature of the 
increments), one gets the following result. 

Theorem 21 (Local 4>-Sobolev for the simple Poisson point process). Let iXt)t^o be the simple 
Poisson point process on with intensity A > 0. Assume that 4) fulfils (H2), then for any t > 0 and any 
smooth f :W^ ^ X: 

Ent|^(/) ^ AtPi(4/(/,Di/)), (53) 

where Ptif) ix) ■= PifiXt) \ Xq = x) is the associated semi-group as in (4). 

In particular, by taking t = 1, we get: 

Corollary 22 (4>-Sobolev for the simple Poisson measure). Let V\ he the Poisson measure V\ of 
mean A > 0, then, under (H2): 

En4,(/)^ AEp,(4/(/,Di/)). (54) 
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Such a result remains true for pure jumps Levy processes (i.e. without Brownian part) by replacing 
Di by the appropriate jump integral. The Brownian part may be added at final stage via tensorisation 
when possible. Namely, we can state the following. 

Theorem 23 (Local <I)-Sobolev for a pure jumps Levy process). Let {Xt)t^o be a pure jump Levy 
process with infinitesimal generator of the form: 


(L/)(x) :=A / 


Dy/(x) - e 


1 + iyr 


V/ 


dv{y), 


(55) 


where (A, 9) G K.+ x and where v is a Levy measure on K.'^. Assume that fulfils ('H2). Then for any 
t > 0 and any smooth / : ^ T: 


Enttfif)^XtPt 


( [ ^if, Byf)dHy)), 


(56) 


where Ptif) (x) := 'E{f{Xt) \ Xq = x) is the associated semi-group as in (4). 

Taking t = 1 in (56) gives the same functional inequality for the infinitely divisible law C{Xi), exactly 
like in Corollary 22 for the simple Poisson measure. Theorem 21 for the simple Poisson point process can 
be recovered by taking (0, v) = (0, (5i) in Theorem 23. 

One can state the following multi-times version of Theorem 23 via the so called Lu-Yau-Stroock- 
Zegarlihski Markov decomposition method. 

Theorem 24 (Multi-times $-Sobolev for pure jumps Levy proc.). Let {Xt)t^o be a pure jumps 
Levy process on as in Theorem 23. Assume that 4) fulfils ('H2). Then for any increasing sequence of 
times to := 0 < ti < • • • < tn and any smooth F : R" ^ T: 

Pnfi^iFiXt, ,..., Y* J) ^ A E (l?^(E(Yt,,..., ))) , (57) 

where 

P n 

i=i 

where for any x G R" 

DV-"E(a:) ■.= FoTfiy){x)-F{x), 
and where for any i G {1, ■. ■ ,n} and x,y gW^ 

F o n{y){x) := F{xi, ..., Xi-i,Xi + y,... ,Xn + y). 

Proof. By induction on n, we can restrict the problem to the case n = 2. Let s and t be two distinct 
times with s < t. We assume that for any u > 0, a 4)-Sobolev inequality holds for £(Y„) (i.e. for P„) 
with constant Xu. We would like to obtain a similar inequality for C{Xs,Xt). Let F : R^ ^ X be a 
smooth bivariate functional. We start with the following conditional decomposition of the 4)-entropy 
(which replaces in some ways the tensorisation property): 

Ent^(F(Y„ Xt)) =E[E($(E) | Xs) - $(E(F | X,))] 

+ E($(E(F I Xs))) - $(E(E(F I Xs))), 

where we abridged F(Xs, Xt) in F in the right hand side. In other words, 

Ent‘^(X) = E(Ent'*'(A: | Y)) -1- Ent‘^(E(A: | Y)), (58) 

where Ent^(Y | Y) is the conditional 4>-entropy defined by 

Ent‘^(Y I Y) := E(4>(X) | Y) - $(E(Y | Y)). 

Now, if Gx{y) := F(x, x + y), we have for any x: 

Pnt^{Gx{Xt_s))^X{t-s)p( [ 'i/{Gx{Xt-s),B.Gx{Xt-s))d,^{z) 
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Therefore, since Xt = Xg -\- Xt — Xg and C{Xa,Xt — Xg) = C{Xg) ® C{Xt-g), the hrst term of the Ent^ 
decomposition can be bounded above as 

E[E($(F) I Xg) - $(E(F I Xg))] < A (t - s) E ( / ^(F, D^F) di^iz) 

KJs.'i 

where the 2 exponent in means that the z translation is done on the second variable of F in the 
definition of D^. On the other hand, if we define Ht-g by 

Ht-gix) :=-E{Fix,x + Xt-g)), 


we have: 

I]nt^(Ht-g{Xg)) ^ AsE^ [ ■^{Ht-g{Xg),D,Ht-g{Xg))diy{z) 

VJRd 

Now, by the commutation formula DzHt-six) := E(D 2 (x F{x,x + Xt-g))) and the bivariate Jensen 
inequality, the last term of the Ent'^ decomposition can be bounded above as follows: 

E($(E(F I X,))) - $(E(E(F I X,))) < A s E D^’^F) dv{z)^ , 

where this time the translation is done in both variables. □ 


In some sense, the Markovian decomposition used in the proof of Theorem 24 replaces the tensorisation 
property (41) when (W 2 ) holds instead of (Wl). It was introduced in Statistical Mechanics for Poincare 
and logarithmic Sobolev inequalities for finite volume Boltzmann-Gibbs measures related to spins systems, 
cf. for example [GZ03] and [Roy99]. 

As for Brownian Motion, one can use a cylindrical method letting n tends to +oo in (57) in order to 
obtain a 4>-Sobolev inequality on paths space, as expressed in the following Theorem. 

Theorem 25 ($-Sobolev on paths space of pure jumps Levy proc.). Let {Xt)t^o be a pure jump 
Levy process as in Theorem 23. Assume that $ fulfils ('H2). Then, for any suitable function F of {Xt)t';>o 
taking its values in T and any T > 0: 


Ent 




(F) ^ AE 



^iF,BlF)dn{y)dt 


(59) 


where 

F>lF{{xs)o^g<^T) '■= F{{xs + 2/I[qT](s))o^s<r) — F{{xg)o^g<^T). 

Obviously, Theorem 25 is stronger than Theorem 23 since the latter can be recovered by simply 
considering in (59) functions F of the form 


F{{x)o^si:T) = G{xti ,...,xt„) 

where T is chosen bigger than t„. Inequality (57) is similar to the one obtained for Brownian Motion (47) 
via tensorisation. Actually, one can use Lu-Yau-Stroock-Zegarlihski Markovian method for Brownian 
Motion. Generic Levy processes - and thus Brownian Motion - are particular examples of normal 
martingales and we believe that the approach used in [PriOO] for logarithmic Sobolev inequalities remains 
valid for <I>-Sobolev inequalities, but one has to precise the condition on 4). 

As explained below, 4>-Sobolev inequalities under (W2) can be established on paths space for generic 
Poisson space via a Clark-Ocone-Haussmann formula. 


5.1 Phi-Sobolev inequalities ou Poissou space 

Let us explain finally how one can recover the 4>-Sobolev inequality on Poisson space via a Clark- 
Ocone-Haussmann formula like what is done in [WuOO] for the logarithmic Sobolev and Poincare inequal¬ 
ities (see also [PriOO]). Following closely [WuOO], let {E,B,v) be a measured space where is a cr-finite 
measure and {W,F,V) the associated Poisson space with compensation measure v. Let also (H,C,P) 
be the Poisson space associated to ([0,1] x F, F([0,1]) ® B, ix{dt,dz) := dt 0 dv{z)), with compensa¬ 
tion measure fx. One can then dehne the difference operators : L°(IF, F) ^ L(F x W,v ®V) and 
Dt,, : LO([0, 1] X F, m) ^ LO([0, 1] x F x ^ O P) by: 

D,F(w) := F(w + (5,) - F(u;) and Df,,F(u;) := D,F(u;), 
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where F{u;) := F(a;([0, l],dz)). For any t G [0,1], we define 

Ct :=a{uj{A)-A&B{%t])(S)B). 

Let now G G L^(ri, P) and let g{t, z,aj) be a predictable dt 0 dv{z) ® P(fiw) version of Ep(Dt ,2 G \ Ct). 
One has then the following Clark-Ocone type predictable representation formula : 

G = 'Ep{G)+ [ [ g{t,z,-)du}{t,z), (60) 

JoJe 

where lu = uj — n is the compensated Poisson point process and where the integral is taken in the sense 
of ltd. Let us assume for convenience that G > inf (X) and let us define the right continuous martingale 
{Mt)ti£[o,i] by Mt := Ep(G' | Ct). Now, since Mt- = Mt dt® P-a.s., Ito formula for jumps processes (cf. 
[IW89]) gives: 

Entp(G) = Ep(d>(Mi) - $(Mo)) 

= Ep^y J 'it{Mt,g{t,z))dtdiy{z)^ . 

Now, it remains to use the bivariate convexity of d' which comes from ('H2) in order to get via Jensen 
inequality and (60) that: 

^{Mt, git, z)) = «'(Ep(G I Ct ), Ep(Dt,, G | C*)) Ep(«'(G, D*,, G) | Ct ), 

which gives finally: 

Ent|(G) Ep(^yy 'i>{G,Dt,^G)dtdiziz)^ . (61) 

The time interval [0,1] can be easily replaced by [0,T]. One can recover Theorems 21, 24, 25 and 24 as 
Corollaries, in exactly the same way used in [WuOO] for Poincare and logarithmic Sobolev inequalities. 
Additionally, one can derive by the same method F.K.G. inequalities, as what is done in [WuOO] for 
Poisson space and in [HuOO] for diffusions. As we have seen, the method used here for Poisson space is a 
replica of the method used to establish Theorem 19 concerning Wiener measure. The major difference is 
the lack of chain rule in discrete space settings which leads to replace (Hi) by (A 2 ). 


5.2 Some remarks about discrete Dirichlet forms 

We collect here few remarks about (742), (742’), and comparisons with standard Dirichlet forms in 
discrete space settings. 

Remark 26 (Comparison with classical Dirichlet forms). Assume that ('H2’) holds. Since is 
coneave, is non-inereasing and therefore, when v ^ 0 

^(u,v) = - / (u + V — w) (w) dw ^ (u) . 

Ju 

The case v ^ 0 is very similar, and we get finally that under (H2’): 

^'(u,u)< d>"(u)u2 on (62) 


On the other hand, if we assume only ('H2), one can write: 


47 ( m , v) 


1 

2 



w) ^"(w) dw {^'{u + v) 




which gives that under ('H2): 


'^iu,v) ^ V {^'{u + v) — ^'{u)) on 


(63) 


Thus, under (H2) (resp. (H2’)), we reeover for example the well known (D/)^// (resp. D/Dlog/J 
Diriehlet forms when 4>(x) = ccloga;, as in [WuOO]. When $( 0 ;) = x'^, both of them give 2 (D/)^. 
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Remark 27 (“L-1” and “L-2” forms of logarithmic Sobolev inequalities). In continuous settings, 
the equivalence between and forms of the logarithmic Sobolev inequality, given respectively by 

V/, Ent^(/2) ^cE^(r/) (64) 

and 

V/>0, EnV(/)^icE^(^^) (65) 

is a consequence of the chain rule (10) for E, which is itself a consequence of the diffusion property 
(11) for the associated infinitesimal generator. In discrete settings, the lack of chain rule destroys this 
equivalence, and actually, the simple Poisson measure satisfies the form but not the form, cf. for 
example [ABG^OO, Chap. If. The fact that the semi-group of the simple Poisson point process is not 
hypercontractive was noticed by Surgailis eighteen years ago in [Sur84]. The concentration of measure 
consequences of the form are only Poisson-like in discrete settings, which is completely logical, cf. 
[Led99]. 

Remark 28 (Poissonian logarithmic Sobolev inequality). Notice that the P operator (cf. (5)) 
associated to the pure jumps Levy process with generator L given by (52) is: 

(r/)(x) ^ / \P)yf{x)\^ dv{y). (66) 

When <i)(a;) =xloga;, the Dirichlet forms comparison (63) yields 


Entp,(/) < XtVt(^jj-\l9yff dviy)^ 
= XtVt[f-^ jjDyff dviy)^ 


= 21 Pi 



which is the form (64) for measure Pi and with constant 21. 


Remark 29 (Discussion on (712) and (712’)). Let ((i{u,v) = ^"{u)v^ and C 2 (m,v) = (4>'(t6 + v) — 
$'(«))?;, for {u,v) G Such functions are non negative since $ is convex. One has 4' ^ Ci when 

is concave, and since 


^Xiiu,v) 


v^ 

2$"'(u)v 


2 $'"(«) w \ 
2 $"(u) 


(HI) is equivalent to ^ 2 4)"'^ which is nothing else than the concavity o/l/4)" when 4)" is convex. 

One can check that 4>(x) = a; log a; and 4>(a;) = are in some sense extremal solutions of this O.D.L, up 
to affine additions. Similarly, one has 4* ^ C 2 when 4) is convex, and since N\ 2 (u,v) is equal to 


( ($"'(«+ v)-$"'(«)) X 

4>"(m + v) - 4>"(m) + 4>"'('u + v)v 


4>"('u + x) - 4>"('u) + 4 >'"(m + x) V 
2 4>"(m + v) + 4>"'(m + v) X 


C 2 is convex in u but not necessarily in v. Finally, let us show why (H2’) implies (H2). Assume that 
(H2’) holds. The function 4* is non negative since 4> is convex, and it is convex in each variable u and v 
when 4) and 4>" are both convex. Moreover, it is bivariate convex when 4>(x) = xlogx or 4>(x) = as 
observed in the pretty paper [WuOO]. Actually, this bi-convexity holds in much more cases since 

4>" (m + x) - 4>" (u) - 4>"' ( m ) V 4>" (m + x) - 4>" (u) 

4>"(m + x)-4>"(u) 4>"(m + x) 

for which Tr 4'(u,x) ^ 0 when 4> and 4)" are convex, and 

Det 4'(u, v) = 4>"(m) (4)"(m + x) — 4>"(u)) — 4>"(u + v) 4>'"(u) v. 



which is non negative since 4> and 4)" are convex and 4>' is concave. Alternatively, one can use Gershgorin- 
Hadamard Theorem (cf. [IIJ90, Sect. 6.1]) to see directly that 4^ is non negative under (H2’). 
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6 Links with Boltzmann-Shannon entropy 

We assume here that T = IR+. Let / be a probability density function on with respect to the 
Lebesgue measure. When $ is convex, one can define the Shannon <I>-entropy of / by: 

H^(/) := -Ent^{fdx\dx) := -EntK/) := - [ ^if){x)dx, (67) 

where $(u) := <&(«) — $(1) u. For any random vector X in with density / with respect to the Lebesgue 
measure, we denote H^(X) := H‘^(/). This functional is invariant by translations and hence does not 
depend on the mean of /. We recover Shannon entropy (or negentropy) H when $(a;) = xlogcc, cf. 
[ABC+OO, Chap. 10]. Actually, S := —H is also known as Shannon Information in Information Theory 
or Boltzmann entropy in Kinetic Gases Theory, see for example [Lie75] and [Vil03]. Similarly, one can 
define the Shannon $-entropy of a discrete probability measure piSx^ + • • • + Pn^x^ by 

n 

H^(pi4i + ---+P„4J :=-^$(p.). ( 68 ) 

Since x G R" i—> $(a;i) + • • • + d)(a;„) is convex, is a concave functional on the simplex 


{(pi,... ,pn) e R+ X • • • X R+, Pi H-|-p„ = 1}. 

At fixed n, it achieves its minimum 0 for Dirac measures, which are the extremal points of the simplex 
above, and its maximum —n$(l/n) for the uniform probability measure by convexity. Notice that the 
continuous version is not always non-negative. 

The important sub-additivity of Shannon entropy H states that for any random vector (Ai,..., A„) 
with an absolute continuous law with respect to the Lebesgue measure: 

H((Xi,...,X„))^H(Xi) + --- + H(X„), (69) 


with equality if and only if Ai,..., A„ are independent. Such a property relies on the non-negativity of 
Kullback-Leibler relative entropy and on the basic additivity of the logarithm: log(a5) = logo -I- log 6 : 

0 s; Ent(/:((Ai, A 2 )) I £(Xi) 0 £(X 2 )) = H((Ai, A 2 )) - H(Ai) - H(A 2 ). 


Notice that this sub-additivity is different from the one related to Kullback-Leibler relative entropy (40) 
since Shannon entropy is opposite in sign and based on the Lebesgue measure which is not a probability 
measure. This fact was explained in [ABC+00, Chap. 10]. Actually, such a sub-additivity property is not 
related to the Lebesgue measure, as noticed in [Cha03], and one can show that for any positive measure 
M = on n product space and any non-negative real valued integrable function /: 


Ent^(/) ^ Ent^i 



• • • -T Ent^,^ 



(70) 


where /i\j := pi x • • • x x pi+i x • • • x p.„, with equality if and only if / is a tensor product function. 
Beware that (40) and (70) are opposite. We ignore if the sub-additivity property (69) of Shannon entropy 
can be generalised to any convex $. Actually, the functional / 1 -^ H'^(/) is concave and formally, the 
Frechet derivatives are given by: 


(DH^)(f)(h) = - [ <^'(f)hdx and h) = - [ <^"(f) dx. 

jR'i jR'i 

Therefore, Shannon like 4)-entropy H'^ achieves its maximum under the linear constraint E(1F(A)) = 
JKdW{x)f{x) dx = c for probability densities functions of the form: 

fw :=(S)'-'(-A-/3VF), 

where (A, (3) G R^ is chosen in such a way that the constraint is fulfilled and that fw is a probability 
density function with respect to the Lebesgue measure. Notice that (‘1’)'“^ is the derivative of the Young 
conjugate of $. The Gaussian maximum of Shannon entropy H at fixed covariance appears as a particular 
case, for which $(x) = xXogx, W{x) = Jx]^, (<I>)'“^(j/) = exp(y — 1), /3 = d/(2c) and A = Zc,d is the 
Gaussian normalising constant. More generally, we recovered as a particular case the famous “principle of 
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maximum entropy” which states that Boltzmann-Shannon entropy is maximised under linear constraint 
by Boltzmann-Gibbs measures. It is tempting and quite natural to ask if shares more common 
properties with Shannon entropy H. As we have seen, the convex conjugate functional will play a 
role. This question is partly answered in [TV93], [BTT86], [BR82a] and [BR82c] and references therein. 
Entropy like measure of information are still actively explored, and one can find recent results in [Dra04] 
and in Flemming Topspe papers for example. 

Final words. One may retain that the classical relative entropy Ent(:^|/x) where /x and v are 
positive Borel measures possesses a lot of properties coming from the very particular base function 
<i>(x) = 0(a;) := xlogx'. 

• 0 is strictly convex, and thus Ent^(/) = 0 iff / is constant /i-a.s.; 

• 0(0) = 0(1) = 0 and thus H(X) vanishes when X is constant; 

• 1/0" is affine and hence concave and thus / ^ Ent^(/) is convex; 

• 0(a5) = b<d{a) + a 0(b) and thus / Ent^(/) is 1-homogeneous; 

• The Young conjugate 0*(u) = is monotone and 0* =0*. 

Recall that 0*('u) := 0'~^ (x) dx. Some of these properties are well imitated by a: \x\^ withp G (1,2], 

which is exactly the family of simple power convex functions between x i—> 0(a;) and x >—)■ x^ and the 
latter is for some aspects the “simplest” one. Some results involving Ent^ rely only on few properties 
of 0 whereas other ones rely on all of them, and this fact obviously puts some limits on the possible 
generalisations. 
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